AITopics | anonymization system

Collaborating Authors

anonymization system

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

SegReConcat: A Data Augmentation Method for Voice Anonymization Attack

Arefeen, Ridwan, Miao, Xiaoxiao, Tong, Rong, Ng, Aik Beng, See, Simon

arXiv.org Artificial IntelligenceAug-27-2025

Anonymization of voice seeks to conceal the identity of the speaker while maintaining the utility of speech data. However, residual speaker cues often persist, which pose privacy risks. We propose SegReConcat, a data augmentation method for attacker-side enhancement of automatic speaker verification systems. SegReConcat segments anonymized speech at the word level, rearranges segments using random or similarity-based strategies to disrupt long-term contextual cues, and concatenates them with the original utterance, allowing an attacker to learn source speaker traits from multiple perspectives. The proposed method has been evaluated in the VoicePrivacy Attacker Challenge 2024 framework across seven anonymization systems, SegReConcat improves de-anonymization on five out of seven systems.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2508.18907

Country: Asia (0.28)

Genre: Research Report (0.64)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks

Gaznepoglu, Ünal Ege, Leschanowsky, Anna, Aloradi, Ahmad, Singh, Prachi, Tenbrinck, Daniel, Habets, Emanuël A. P., Peters, Nils

arXiv.org Artificial IntelligenceJun-12-2025

Speaker anonymization systems hide the identity of speakers while preserving other information such as linguistic content and emotions. To evaluate their privacy benefits, attacks in the form of automatic speaker verification (ASV) systems are employed. In this study, we assess the impact of intra-speaker linguistic content similarity in the attacker training and evaluation datasets, by adapting BERT, a language model, as an ASV system. On the VoicePrivacy Attacker Challenge datasets, our method achieves a mean equal error rate (EER) of 35%, with certain speakers attaining EERs as low as 2%, based solely on the textual content of their utterances. Our explainability study reveals that the system decisions are linked to semantically similar keywords within utterances, stemming from how LibriSpeech is curated. Our study suggests reworking the VoicePrivacy datasets to ensure a fair and unbiased evaluation and challenge the reliance on global EER for privacy evaluations.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2506.09521

Country: Europe (0.47)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.67)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

Children's Voice Privacy: First Steps And Emerging Challenges

Kulkarni, Ajinkya, Teixeira, Francisco, Hermann, Enno, Rolland, Thomas, Trancoso, Isabel, Doss, Mathew Magimai

arXiv.org Artificial IntelligenceJun-5-2025

Children are one of the most under-represented groups in speech technologies, as well as one of the most vulnerable in terms of privacy. Despite this, anonymization techniques targeting this population have received little attention. In this study, we seek to bridge this gap, and establish a baseline for the use of voice anonymization techniques designed for adult speech when applied to children's voices. Such an evaluation is essential, as children's speech presents a distinct set of challenges when compared to that of adults. This study comprises three children's datasets, six anonymization methods, and objective and subjective utility metrics for evaluation. Our results show that existing systems for adults are still able to protect children's voice privacy, but suffer from much higher utility degradation. In addition, our subjective study displays the challenges of automatic evaluation methods for speech quality in children's speech, highlighting the need for further research.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2506.001

Country: Europe > Portugal (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Security & Privacy (0.94)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.69)
Information Technology > Artificial Intelligence > Natural Language (0.68)

Add feedback

The First VoicePrivacy Attacker Challenge

Tomashenko, Natalia, Miao, Xiaoxiao, Vincent, Emmanuel, Yamagishi, Junichi

arXiv.org Artificial IntelligenceApr-22-2025

Training, development, and evaluation datasets were provided along with a baseline attacker . Participants developed their attacker systems in the form of automatic speaker verification systems and submitted their scores on the development and evaluation data. The best attacker systems reduced the equal error rate (EER) by 25-44% relative w.r .t. the baseline. Index T erms --V oice privacy, voice anonymization, attacker system, automatic speaker verification I. C ONTEXT Speech conveys a lot of personal data, e.g., age and gender, health, geographical or ethnic origin, and socio-economic status. Formed in 2020, the V oicePrivacy initiative [1] promotes privacy enhancing solutions for speech technology via a series of benchmarking challenges.

anonymization system, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICASSP49660.2025.10888513

2504.14183

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.15)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.92)
Information Technology > Artificial Intelligence > Speech > Acoustic Processing (0.57)

Add feedback

Analysis of Speech Temporal Dynamics in the Context of Speaker Verification and Voice Anonymization

Tomashenko, Natalia, Vincent, Emmanuel, Tommasi, Marc

arXiv.org Artificial IntelligenceDec-22-2024

Abstract--In this paper, we investigate the impact of speech methods use large-scale pre-trained models for extracting specific temporal dynamics in application to automatic speaker verification attributes and provide better content and privacy preservation than and speaker voice anonymization tasks. We propose several signal processing based methods. The diversity of approaches is metrics to perform automatic speaker verification based only illustrated by the VoicePrivacy 2024 Challenge [10], which provided on phoneme durations. Experimental results demonstrate that six baseline anonymization systems, namely anonymization using x-phoneme durations leak some speaker information and can reveal vectors and a neural source-filter model [6], [11], signal processing speaker identity from both original and anonymized speech. While specific studies have been dedicated to speaker information carried by pitch [5], [6], [8], the impact of speech temporal dynamics on speaker verification and re-identification has been overlooked.

anonymization, artificial intelligence, speech recognition, (15 more...)

arXiv.org Artificial Intelligence

2412.17164

Country:

Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)
Asia (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.69)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback

The First VoicePrivacy Attacker Challenge Evaluation Plan

Tomashenko, Natalia, Miao, Xiaoxiao, Vincent, Emmanuel, Yamagishi, Junichi

arXiv.org Artificial IntelligenceOct-21-2024

The First VoicePrivacy Attacker Challenge is a new kind of challenge organized as part of the VoicePrivacy initiative and supported by ICASSP 2025 as the SP Grand Challenge It focuses on developing attacker systems against voice anonymization, which will be evaluated against a set of anonymization systems submitted to the VoicePrivacy 2024 Challenge. Training, development, and evaluation datasets are provided along with a baseline attacker system. Participants shall develop their attacker systems in the form of automatic speaker verification systems and submit their scores on the development and evaluation data to the organizers. To do so, they can use any additional training data and models, provided that they are openly available and declared before the specified deadline. The metric for evaluation is equal error rate (EER). Results will be presented at the ICASSP 2025 special session to which 5 selected top-ranked participants will be invited to submit and present their challenge systems.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.07428

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Asia > Singapore (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
(2 more...)

Add feedback

Privacy versus Emotion Preservation Trade-offs in Emotion-Preserving Speaker Anonymization

Cai, Zexin, Xinyuan, Henry Li, Garg, Ashi, García-Perera, Leibny Paola, Duh, Kevin, Khudanpur, Sanjeev, Andrews, Nicholas, Wiesner, Matthew

arXiv.org Artificial IntelligenceSep-5-2024

Advances in speech technology now allow unprecedented access to personally identifiable information through speech. To protect such information, the differential privacy field has explored ways to anonymize speech while preserving its utility, including linguistic and paralinguistic aspects. However, anonymizing speech while maintaining emotional state remains challenging. We explore this problem in the context of the VoicePrivacy 2024 challenge. Specifically, we developed various speaker anonymization pipelines and find that approaches either excel at anonymization or preserving emotion state, but not both simultaneously. Achieving both would require an in-domain emotion recognizer. Additionally, we found that it is feasible to train a semi-effective speaker verification system using only emotion representations, demonstrating the challenge of separating these two modalities.

emotion recognizer, speech, utterance, (15 more...)

arXiv.org Artificial Intelligence

2409.03655

Country:

Asia > Macao (0.04)
Asia > China (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.90)

Add feedback

Probing the Feasibility of Multilingual Speaker Anonymization

Meyer, Sarina, Lux, Florian, Vu, Ngoc Thang

arXiv.org Artificial IntelligenceJul-3-2024

In speaker anonymization, speech recordings are modified in a way that the identity of the speaker remains hidden. While this technology could help to protect the privacy of individuals around the globe, current research restricts this by focusing almost exclusively on English data. In this study, we extend a state-of-the-art anonymization system to nine languages by transforming language-dependent components to their multilingual counterparts. Experiments testing the robustness of the anonymized speech against privacy attacks and speech deterioration show an overall success of this system for all languages. The results suggest that speaker embeddings trained on English data can be applied across languages, and that the anonymization performance for a language is mainly affected by the quality of the speech synthesis component used for it.

anonymization, proc, speech, (14 more...)

arXiv.org Artificial Intelligence

2407.02937

Country: Europe > Germany > Baden-Württemberg > Stuttgart Region > Stuttgart (0.04)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The VoicePrivacy 2024 Challenge Evaluation Plan

Tomashenko, Natalia, Miao, Xiaoxiao, Champion, Pierre, Meyer, Sarina, Wang, Xin, Vincent, Emmanuel, Panariello, Michele, Evans, Nicholas, Yamagishi, Junichi, Todisco, Massimiliano

arXiv.org Artificial IntelligenceJun-12-2024

The task of the challenge is to develop a voice anonymization system for speech data which conceals the speaker's voice identity while protecting linguistic content and emotional states. The organizers provide development and evaluation datasets and evaluation scripts, as well as baseline anonymization systems and a list of training resources formed on the basis of the participants' requests. Participants apply their developed anonymization systems, run evaluation scripts and submit evaluation results and anonymized speech data to the organizers. Results will be presented at a workshop held in conjunction with Interspeech 2024 to which all participants are invited to present their challenge systems and to submit additional workshop papers.

anonymization system, evaluation, international conference, (13 more...)

arXiv.org Artificial Intelligence

2404.02677

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Europe > Greece (0.04)
Europe > France (0.04)
(3 more...)

Genre:

Research Report (0.50)
Instructional Material (0.34)

Industry:

Information Technology > Security & Privacy (1.00)
Law (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Evaluation of Speaker Anonymization on Emotional Speech

Nourtel, Hubert, Champion, Pierre, Jouvet, Denis, Larcher, Anthony, Tahon, Marie

arXiv.org Artificial IntelligenceApr-15-2023

Speech data carries a range of personal information, such as the speaker's identity and emotional state. These attributes can be used for malicious purposes. With the development of virtual assistants, a new generation of privacy threats has emerged. Current studies have addressed the topic of preserving speech privacy. One of them, the VoicePrivacy initiative aims to promote the development of privacy preservation tools for speech technology. The task selected for the VoicePrivacy 2020 Challenge (VPC) is about speaker anonymization. The goal is to hide the source speaker's identity while preserving the linguistic information. The baseline of the VPC makes use of a voice conversion. This paper studies the impact of the speaker anonymization baseline system of the VPC on emotional information present in speech utterances. Evaluation is performed following the VPC rules regarding the attackers' knowledge about the anonymization system. Our results show that the VPC baseline system does not suppress speakers' emotions against informed attackers. When comparing anonymized speech to original speech, the emotion recognition performance is degraded by 15\% relative to IEMOCAP data, similar to the degradation observed for automatic speech recognition used to evaluate the preservation of the linguistic information.

artificial intelligence, machine learning, speech, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.21437/SPSC.2021-13

2305.01759

Country:

Europe > United Kingdom > England > Greater London > London (0.04)
Europe > Slovenia (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback